智能论文笔记

Denoising Diffusion Models for Out-of-Distribution Detection

Mark S. Graham , Walter H. L. Pinaya , Petru-Daniel Tudosiu , Parashkev Nachev , Sebastien Ourselin , M. Jorge Cardoso

分类：机器学习 | 计算机视觉

2022-11-14

Out-of-distribution detection is crucial to the safe deployment of machine learning systems. Currently, the state-of-the-art in unsupervised out-of-distribution detection is dominated by generative-based approaches that make use of estimates of the likelihood or other measurements from a generative model. Reconstruction-based methods offer an alternative approach, in which a measure of reconstruction error is used to determine if a sample is out-of-distribution. However, reconstruction-based approaches are less favoured, as they require careful tuning of the model's information bottleneck - such as the size of the latent dimension - to produce good results. In this work, we exploit the view of denoising diffusion probabilistic models (DDPM) as denoising autoencoders where the bottleneck is controlled externally, by means of the amount of noise applied. We propose to use DDPMs to reconstruct an input that has been noised to a range of noise levels, and use the resulting multi-dimensional reconstruction error to classify out-of-distribution inputs. Our approach outperforms not only reconstruction-based methods, but also state-of-the-art generative-based approaches.

translated by 谷歌翻译

Can segmentation models be trained with fully synthetically generated data?

Virginia Fernandez , Walter Hugo Lopez Pinaya , Pedro Borges , Petru-Daniel Tudosiu , Mark S Graham , Tom Vercauteren , M Jorge Cardoso

分类：计算机视觉

2022-09-17

为了实现良好的性能和概括性，医疗图像分割模型应在具有足够可变性的大量数据集上进行培训。由于道德和治理限制以及与标签数据相关的成本，经常对科学发展进行扼杀，并经过对有限数据的培训和测试。数据增强通常用于人为地增加数据分布的可变性并提高模型的通用性。最近的作品探索了图像合成的深层生成模型，因为这种方法将使有效的无限数据生成多种多样的数据，从而解决了通用性和数据访问问题。但是，许多提出的解决方案限制了用户对生成内容的控制。在这项工作中，我们提出了Brainspade，该模型将基于合成扩散的标签发生器与语义图像发生器结合在一起。我们的模型可以在有或没有感兴趣的病理的情况下产生完全合成的大脑标签，然后产生任意引导样式的相应MRI图像。实验表明，Brainspade合成数据可用于训练分割模型，其性能与在真实数据中训练的模型相当。

translated by 谷歌翻译

Morphology-preserving Autoregressive 3D Generative Modelling of the Brain

Petru-Daniel Tudosiu , Walter Hugo Lopez Pinaya , Mark S. Graham , Pedro Borges , Virginia Fernandez , Dai Yang , Jeremy Appleyard , Guido Novati , Disha Mehra , Mike Vella

分类：计算机视觉 | 机器学习

2022-09-07

可以使用医学成像数据研究人类解剖学，形态和相关疾病。但是，访问医学成像数据受到治理和隐私问题，数据所有权和获取成本的限制，从而限制了我们理解人体的能力。解决此问题的一个可能解决方案是创建能够学习的模型，然后生成以相关性的特定特征（例如，年龄，性别和疾病状态）来生成人体的合成图像。最近，以神经网络形式的深层生成模型已被用于创建自然场景的合成2D图像。尽管如此，数据稀缺性，算法和计算局限性仍阻碍了具有正确解剖形态的高分辨率3D体积成像数据的能力。这项工作提出了一个生成模型，可以缩放以产生人类大脑的解剖学正确，高分辨率和现实的图像，并具有必要的质量，以允许进一步的下游分析。产生潜在无限数据的能力不仅能够对人体解剖学和病理学进行大规模研究，而不会危及患者的隐私，而且还可以在异常检测，模态综合，有限的数据和公平和公平和公平和公平和公平和公平和公平和公平和公平和公平和公平和公平和公平的学习领域进行显着提高。道德AI。代码和训练有素的模型可在以下网址提供：https：//github.com/amigolab/synthanatomy。

translated by 谷歌翻译

Data Science and Machine Learning in Education

Gabriele Benelli , Thomas Y. Chen , Javier Duarte , Matthew Feickert , Matthew Graham , Lindsey Gray , Dan Hackett , Phil Harris , Shih-Chieh Hsu , Gregor Kasieczka

分类：机器学习

2022-07-19

鉴于HEP研究的核心，数据科学（DS）和机器学习（ML）在高能量物理学（HEP）中的作用增长良好和相关。此外，利用物理数据固有的对称性激发了物理信息的ML作为计算机科学研究的充满活力的子场。 HEP研究人员从广泛使用的材料中受益匪浅，可用于教育，培训和劳动力开发。他们还为这些材料做出了贡献，并为DS/ML相关的字段提供软件。物理部门越来越多地在DS，ML和物理学的交集上提供课程，通常使用HEP研究人员开发的课程，并涉及HEP中使用的开放软件和数据。在这份白皮书中，我们探讨了HEP研究与DS/ML教育之间的协同作用，讨论了此交叉路口的机会和挑战，并提出了将是互惠互利的社区活动。

translated by 谷歌翻译

Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

Walter H. L. Pinaya , Mark S. Graham , Robert Gray , Pedro F Da Costa , Petru-Daniel Tudosiu , Paul Wright , Yee H. Mah , Andrew D. MacKinnon , James T. Teo , Rolf Jager

分类：计算机视觉

2022-06-07

深层生成模型已成为检测数据中任意异常的有前途的工具，并分配了手动标记的必要性。最近，自回旋变压器在医学成像中取得了最先进的性能。但是，这些模型仍然具有一些内在的弱点，例如需要将图像建模为1D序列，在采样过程中误差的积累以及与变压器相关的显着推理时间。去核扩散概率模型是一类非自动回旋生成模型，最近显示出可以在计算机视觉中产生出色的样品（超过生成的对抗网络），并实现与变压器具有竞争力同时具有快速推理时间的对数可能性。扩散模型可以应用于自动编码器学到的潜在表示，使其易于扩展，并适用于高维数据（例如医学图像）的出色候选者。在这里，我们提出了一种基于扩散模型的方法，以检测和分段脑成像中的异常。通过在健康数据上训练模型，然后探索其在马尔可夫链上的扩散和反向步骤，我们可以识别潜在空间中的异常区域，因此可以确定像素空间中的异常情况。我们的扩散模型与一系列具有2D CT和MRI数据的实验相比，具有竞争性能，涉及合成和实际病理病变，推理时间大大减少，从而使它们的用法在临床上可行。

translated by 谷歌翻译

Applications of AI in Astronomy

S. G. Djorgovski , A. A. Mahabal , M. J. Graham , K. Polsterer , A. Krone-Martins

分类：人工智能 | 机器学习

2022-12-03

We provide a brief, and inevitably incomplete overview of the use of Machine Learning (ML) and other AI methods in astronomy, astrophysics, and cosmology. Astronomy entered the big data era with the first digital sky surveys in the early 1990s and the resulting Terascale data sets, which required automating of many data processing and analysis tasks, for example the star-galaxy separation, with billions of feature vectors in hundreds of dimensions. The exponential data growth continued, with the rise of synoptic sky surveys and the Time Domain Astronomy, with the resulting Petascale data streams and the need for a real-time processing, classification, and decision making. A broad variety of classification and clustering methods have been applied for these tasks, and this remains a very active area of research. Over the past decade we have seen an exponential growth of the astronomical literature involving a variety of ML/AI applications of an ever increasing complexity and sophistication. ML and AI are now a standard part of the astronomical toolkit. As the data complexity continues to increase, we anticipate further advances leading towards a collaborative human-AI discovery.

translated by 谷歌翻译

Automatic lesion analysis for increased efficiency in outcome prediction of traumatic brain injury

Margherita Rosnati , Eyal Soreq , Miguel Monteiro , Lucia Li , Neil S. N. Graham , Karl Zimmerman , Carlotta Rossi , Greta Carrara , Guido Bertolini , David J. Sharp

分类：计算机视觉

2022-08-08

对脑外伤（TBI）患者的准确预后很难为治疗，患者管理和长期护理提供信息至关重要。年龄，运动和学生反应性，缺氧和低血压以及计算机断层扫描（CT）的放射学发现等患者特征已被确定为TBI结果预测的重要变量。 CT是临床实践中选择的急性成像方式，因为其获取速度和广泛的可用性。但是，这种方式主要用于定性和半定量评估，例如马歇尔评分系统，该系统容易受到主观性和人为错误。这项工作探讨了使用最先进的，深度学习的TBI病变分割方法从常规获得的医院入院CT扫描中提取的成像生物标志物的预测能力。我们使用病变体积和相应的病变统计作为扩展TBI结果预测模型的输入。我们将我们提出的功能的预测能力与马歇尔分数进行比较，并与经典的TBI生物标志物配对。我们发现，在预测不利的TBI结果时，自动提取的定量CT功能的性能与Marshall分数相似或更好。利用自动地图集对齐，我们还确定额叶外病变是不良预后的重要指标。我们的工作可能有助于更好地理解TBI，并提供有关如何使用自动化神经影像分析来改善TBI后预测的新见解。

translated by 谷歌翻译

Beyond Low Earth Orbit: Biomonitoring, Artificial Intelligence, and Precision Space Health

Ryan T. Scott , Erik L. Antonsen , Lauren M. Sanders , Jaden J. A. Hastings , Seung-min Park , Graham Mackintosh , Robert J. Reynolds , Adrienne L. Hoarfrost , Aenor Sawyer , Casey S. Greene

分类：机器学习

2021-12-22

超越地球轨道的人类空间勘探将涉及大量距离和持续时间的任务。为了有效减轻无数空间健康危害，数据和空间健康系统的范式转移是实现地球独立性的，而不是Earth-Reliance所必需的。有希望在生物学和健康的人工智能和机器学习领域的发展可以解决这些需求。我们提出了一个适当的自主和智能精密空间健康系统，可以监控，汇总和评估生物医学状态;分析和预测个性化不良健康结果;适应并响应新累积的数据;并提供对其船员医务人员的个人深度空间机组人员和迭代决策支持的预防性，可操作和及时的见解。在这里，我们介绍了美国国家航空航天局组织的研讨会的建议摘要，以便在太空生物学和健康中未来的人工智能应用。在未来十年，生物监测技术，生物标志科学，航天器硬件，智能软件和简化的数据管理必须成熟，并编织成精确的空间健康系统，以使人类在深空中茁壮成长。

translated by 谷歌翻译

Beyond Low Earth Orbit: Biological Research, Artificial Intelligence, and Self-Driving Labs

Lauren M. Sanders , Jason H. Yang , Ryan T. Scott , Amina Ann Qutub , Hector Garcia Martin , Daniel C. Berrios , Jaden J. A. Hastings , Jon Rask , Graham Mackintosh , Adrienne L. Hoarfrost

分类：机器学习

2021-12-22

空间生物学研究旨在了解太空飞行对生物的根本影响，制定支持深度空间探索的基础知识，最终生物工程航天器和栖息地稳定植物，农作物，微生物，动物和人类的生态系统，为持续的多行星寿命稳定。要提高这些目标，该领域利用了来自星空和地下模拟研究的实验，平台，数据和模型生物。由于研究扩展到低地球轨道之外，实验和平台必须是最大自主，光，敏捷和智能化，以加快知识发现。在这里，我们介绍了由美国国家航空航天局的人工智能，机器学习和建模应用程序组织的研讨会的建议摘要，这些应用程序为这些空间生物学挑战提供了关键解决方案。在未来十年中，将人工智能融入太空生物学领域将深化天空效应的生物学理解，促进预测性建模和分析，支持最大自主和可重复的实验，并有效地管理星载数据和元数据，所有目标使生活能够在深空中茁壮成长。

translated by 谷歌翻译

FAIR AI Models in High Energy Physics

Javier Duarte , Haoyang Li , Avik Roy , Ruike Zhu , E. A. Huerta , Daniel Diaz , Philip Harris , Raghav Kansal , Daniel S. Katz , Ishaan H. Kavoori

分类：机器学习

2022-12-09

The findable, accessible, interoperable, and reusable (FAIR) data principles have provided a framework for examining, evaluating, and improving how we share data with the aim of facilitating scientific discovery. Efforts have been made to generalize these principles to research software and other digital products. Artificial intelligence (AI) models -- algorithms that have been trained on data rather than explicitly programmed -- are an important target for this because of the ever-increasing pace with which AI is transforming scientific and engineering domains. In this paper, we propose a practical definition of FAIR principles for AI models and create a FAIR AI project template that promotes adherence to these principles. We demonstrate how to implement these principles using a concrete example from experimental high energy physics: a graph neural network for identifying Higgs bosons decaying to bottom quarks. We study the robustness of these FAIR AI models and their portability across hardware architectures and software frameworks, and report new insights on the interpretability of AI predictions by studying the interplay between FAIR datasets and AI models. Enabled by publishing FAIR AI models, these studies pave the way toward reliable and automated AI-driven scientific discovery.

translated by 谷歌翻译